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Recently Mirsky and Farahat proposed the problem of characterizing the class of doubly 
stochastic matrices for which the least number of permutation matrices necessary to repre- 
sent it as a convex sum has a prescribed value. It is shown that this number can be related 
to the number of eigenvalues of modulus one. The problem of similarity of doubly stochastic 
matrices is also treated. Finally, the question of transitivity of powers of sets of functions 
on the first n positive integers into itself is treated by defining a corresponding incidence 
matrix and examining its powers. 



I. Introduction 



In 1946 [1] 3 BirkhofT proved that any doubly 
stochastic (d.s.) matrix with non-negative entries is 
a convex combination of permutation matrices. We 
recall that a matrix is d.s. if every row and column 
sum is 1. Let tt n denote the polyhedron of non- 
negative d.s. matrices and as noted in [5] dim Q n 
= (n—\) 2 and hence any A& n can be written as a 
convex combination of at most (n— 1) 2 + 1 permuta- 
tion matrices. Mirsky and Farahat in a recent paper 
[6] suggested an investigation of the minimum num- 
ber P(A) of permutation matrices necessary to repre- 
sent Ael2 n as a convex combination. In section 2 we 
obtain an inequality relating fi(A) to h(A), the num- 
ber of characteristic roots of A of absolute value 1. 
In section 2 we also study a problem of the similarity 
of two matrices in £l n . 

In 3 we discuss conditions on A>0 that are implied 

n 

by the equality det (I— A) = 11 (l—a u ). 

i=l 

In 4 we obtain an inequality for the difference 
between the maximal characteristic roots of two 
matrices A and B satisfying B>A>0, where B>A 
means 6 lj; >a ti for all i,j. In certain cases our ine- 
quality yields better bounds than those obtained by 
applying the results in [2] or [7]. 

In 5 we discuss the question of transitivity of sets 
of functions on the set of integers {1, . . ., n] into 
itself by matrix methods. 



2. Doubly Stochastic Matrices 

A precise statement of the result relating h{A) 
and P(A) is contained in theorem 3. We prove two 
preliminary results of some interest in themselves. 
Recall that A>0 is reducible if there exists a per- 
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mutation matrix P such that 



PAP'-- 



Ai o 

A 2 \ A 2 



(i) 



where A n and A 22 are square matrices. It is clear 
that if Aett n then (1) implies that A 2l = Q. It follows 
that in case Aeti n a permutation matrix P may be 
chosen so that PAP' is the direct sum of irreducible 
d.s. matrices and 1 -square matrices. 

Theorem 1. If SeQ n and S is irreducible , then h(S) 
is a divisor of n. 

Proof. By the classical Perron-Frobenius 
theorem, more recently reproved in [8], there exists 
a permutation matrix P such that 



PSP'- 



ro, 


& 


. 


. . ^ 




2 


So . 


. . 
S h -i 


^Sh 







. . o* J 



(2) 



where f is a principal square matrix of zeros and 
h(S) = h. Let r(X) (c(X)) denote the number of 
rows (columns) of X. Clearly since Seti n each S< is 
d.s. and r (S x ) is the sum of the elements of S t as is 
c(S t ). Hence, 

r(iS < )=c(SO=c(0 <+ i)=r(0 <+ i)=r(Sf <+1 )==c(flf <+ i). (3) 

Thus every S t is of the same order and h(S) 
is a divisor of n . We next obtain a relation between 



0(S) and 0(S t ) where S=S { + 
denotes the direct sum. 

Theorem 2. If S r i €i2 Wi , *=1, 

and S=Si - . . . +S m , then 



(KS)<T;KS t )-m+i. 

i = l 



+ S m and + 



m, 



i = l 



(4) 
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if 



Proof. First we prove by induction onr+s that 



A=Tj 0j> t , 



i=l 






o<0i<0 2 < . . . <o r , o<^<^ 2 . . . <<p s , S ^=i=S ^ 

P i} Qj are permutation matrices, i=l, . . ., r, J— 1, 

. . ., s, then ^4+^S is in the convex hull of no more 
than r+s—1 permutation matrices of the form 

Pi+Qj. When r+s=2, then A=P U B=Q U and 

A -j- 5 = Pi + Qi . Suppose for convenience that fa < fa . 
Then 

A+B=e 1 (P 1 +Qd+(i-e l )^(± T ^Px^ 

Hr« + M*)} (5) 



and S , ^ =lj and 

; = 2 1— C/l 



1 



l ;=2 1 



<Py 



fiH 



^j" 



1-01 



= 1. 



Thus A+B=d 1 (P 1 + Q 1 ) + (l-6 1 ) R where R& n is 
the direct sum of two d.s. matrices, the first in- 
volving r— 1 permutation matrices and the second 
involving s permutation matrices. We apply the 
induction to R to conclude that f$(R) <r-\-s— 2. 

Hence f3(A-\-B)<r-\-s—l. Then an obvious in- 
duction on m yields 

<fi(8i+.. .+S m -i)+p(S n )-l 

m— l 

<T,P(S i )-m+2+f}(S m )-l 



Theorem 3. // Sett n and S is irreducible, then 



P(S)<h 



(H 



+1. 



(6) 



where h=h(S). 

Proof. We first observe that if P and Q are 
permutation matrices then fi(PSQ) = P(S) for any 
Se2 n . By theorem 1 let n=qh where q is an integer 
and let R be the permutation matrix with 1 in 
positions 

(q+i, i) mod n. 
Then from (2) we easily compute tf at 



PSP'R = S l + . . . +&_!+&, (7) 

and hence by theorem 2 

/3(PSP'R)=(3(S)< 2 0(St)-h+l. 

But by the Birkhoff result quoted in the introduction 
we have that 



/W<(<Z-D 2 +1. 
Hence, 

P(S)<h[(z-l) 2 +l]-h+l=hl 



0-0" 



+1. 



Actually the above techniques may be applied to 
yield better estimates on fi(S) than that given by a 
direct application of (6). There are several alter- 
natives for each of the matrices S t in (7): S t is 
reducible; S t is irreducible with h(Si)p>l; S t is irre- 
ducible with h(Si) = l; i.e., S t is primitive. These 
situations are covered by the following two theorems. 

Theorem 4. If SeQ n and P is such a permutation 

matrix that PSP ; = Ti + . • • +T k where each Tj is 
an n r square matrix either irreducible or with n j =l > 
then 

i)Vt,)+i. 



« S)s sGw 



Theorem 5. // SeQ n and P and Q are such permu- 
tation matrices that 



PSQ=Si+ ...+8 m 



(8) 



where each Sj is an n r square matrix either primitive 
or with %=1, then 

m 

r 3(S)<T J (n j -iy+l. 

3 = 1 

We describe the procedure for obtaining the form 
(8) for S: First reduce S to the direct sum of irre- 
ducible or 1 -square d.s. matrices. Each of these 
may then be reduced to the form (2) which in turn 
is reduced to a direct sum of irreducible or 1-square 
matrices. The process is repeated until every matrix 
which appears is either 1-square or primitive. We 
illustrate this with the following example: Let SeQ B 
be given by 

/0 4 Ja\ 

\A o 4 y 

t and I t are 2 -square zero and 



S 



/0 2 I2 
where -4=1 

V2 2 

identity matrices respectively, and J t eto t is the 
^-square matrix all of whose entries are l/t. We 
verify directly that no coordinate subspace is held 
invariant by S and hence S is irreducible. We first 

note that S 2 =J 4 +Ji and hence the characteristic 
roots of S 2 are 1 and 0. Hence the characteristic 
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roots of S are 1, — 1, and with appropriate multi- 
plicities. On the other hand, 1 is simple by the 
Perron-Frobenius result and tr(£) = 0. Hence the 
characteristic roots of S must be 1, — 1, and (six 
times) and h(S) = 2. Thus S is already in the form 
(2) and we bring S by a permutation to the form 

Jt+A and by a further one-sided permutation to 
Thus by theorem 2, 

]8(S)<i8(J4)+|8(/i)+i8(/i)+/3(J2)-4 + l. (9) 

A direct application of theorem 3 to J 4 yields I3(J A ) 
<10 and (9) implies that 



0(5) < 11. 



(10) 



Of course it is obvious by inspection that /3(e/ 4 )=4 
and hence (9) gives P(S)<5. Note that a direct 
application of theorem 3 yields ($(S)<19 and the 
bound given by the dimension of ft 8 is fi(S)<50. 
As a matter of fact, (3(S) =4. 

Recently one of the present authors and A. J. 
Hoffman independently proved that if Sefi n then 

n 

there exists a permutation a such that IT s i(T ^)>l/n n . 

It follows that the permanent of S satisfies the 
following inequality: 

per (S) = S H s W) >l/n n 

a 1=1 



for Sett n . Since 



0GS) 



/3GS) 



where P t are permutation matrices and c*i>0, XJ a % 

i=l 

= 1, we see that 

per(S)>(l//3(S)) w - 1 . 

It follows that we may apply the estimates of 
theorems 3, 4, 5 to obtain a lower bound for S that 
in some cases is better than l/n n . These results are 
motivated by a conjecture of v. der Waerden [4] 
concerning the minimum value of the permanent of 
a matrix in Q n . 

We next discuss circumstances under which two 
matrices in 12 n are similar. If Sett n and XSX~ 1 eQ n 
it of course is false that either X or X~ l must be d.s. ; 
e.g., take S=I n . However if the root 1 of S is 
simple then we have the following result. 

Theorem 6. If Sett n and S is irreducible and 
XSX~ l =R€tt n , then X is a multiple of a d.s. matrix. 
Moreover, there exists YeQ, n such that YSY~ 1 =R. 

Proof. Let J be the n-square matrix all of 
whose entries are 1, and let <Ji(X) be the sum of the 
entries in the -ith row of X. Then XS=RX implies 
that XSJ=RXJ, XJ=R(XJ). Each column of 
XJ is equal to the n-tuple (<ti(X), . . . , <r n (X)) 
=u(X). Then Ru(X)=u(X) and since 1 is 
a simple characteristic root of S and R (S is irre- 
ducible), u(X) must be a scalar multiple of the 



vector e all of whose coordinates are 1. Thus 
<Ti(X) = a, i=l, . . . , n, and by a similar argu- 
ment we easily see that <r t (X') = p, i=l, . . ., 
n. But then a=f$ and X is a scalar multiple of 
a d.s. matrix. Now X commutes with J and thus 
the characteristic roots of X-\-kJ are \i~\-kn, X 2 , 
. . . , X n , where X* are the roots of X and Xi cor- 
responds to the characteristic vector e. Choose 
k so that \i+kn?*0 and X+&J>0, and let 
Y =(<x J rnk)~ 1 {X-\-kJ). This completes the proof. 

3. Triangular Non-Negative Matrices 

Theorem 7. Let A>0 with maximal non-nega- 
tive characteristic root r<\. Then a necessary 
and sufficient condition that there exists a permutation 
matrix P such that PAP' is triangular is 



n (l-a„)=det (I n -A). 
Proof. We prove first that if 



(ID 



f(t)=H (t-a H )-det {tl n - A), (12) 

i=l 

then J(t)>0 for t>r. In this argument and 
subsequently we use the classical result that states 
that if a; is the dominant non-negative characteristic 
root of a principal submatrix of A, then a < r and 
if A is irreducible then the inequality is strict. 
Hence, /(r^n^i (r— a«) — det (rI n —A)>0 and the 
inequality is strict if A is irreducible. To prove 
that j(t) > we proceed by induction on n. Now 
differentiating (12) we have 

f(t)=ibf n (t- ajj )-det (tI n - x -A(i\i)) \, (13) 

where A (i\i) is the principal submatrix of A obtained 
by omitting the ith. row and column. For each i 
the summand in (13) is non-negative by the in- 
duction hypothesis (since t> r and r is not smaller 
than the dominant non-negative characteristic root 
of A{i\i)). Thus f(t)>0 and /(r)^0 so that 
/ (1)>0. Moreover if A were irreducible it would 
follow that/(l) >/(r)=H2 1 (r-a«)>0. 

We complete the proof as follows. Choose a 
permutation matrix P such that PAP' has the 
following sequence of blocks down the main diagonal. 

B x +. . .+B m =B, 

where each B t is 1 -square or possibly n r square irre- 
ducible. We show that the second alternative 
is impossible and it follows that PAP' is triangular. 
Clearly (11) implies that 



n (i- a «)=n (l-6„) = det (I n -A) = det {I n —B) 



= 11 det (I n -B k ). 



fc=i 
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Suppose some B k =C is irreducible. Then by the 
above argument 

TL(l-c it )>6Bt(I u -BJ, 

i=l K 

and we would conclude that 

m n 

ndet(/ n -B,)<n (l-a„), 



a contradiction. This completes the proof. 

4. Inequalities for the Dominant 
Characteristic Root 

Let E (n) denote the unit n-simplex. If A> is 
irreducible, let a be the dominant positive root and 
let 0<CxeE {n) be the unique characteristic vector of A 
in E in) corresponding to a. Suppose R>0, R^O 
commutes with A. Then Ax = ax, RAx = aRx, 
A(Rx/a(Rx)) = a(Rx/a(Rx)) where if y=(yi . . . y n ) 
then ^W = 2i!i?/i. But then the uniqueness of x 
implies that Rx/a(Rx)=x because (Rx/a(Rx))eE w . 

Theorem 8. Let B> A be two non-negative 
irreducible n-square matrices with dominant positive 
roots (3 and a respectively. Let M=max (by—at/), 

ij 

m=min {ba—a^) and suppose that R is a non- 

negative matrix which commutes with either A or B 
and possesses no zero columns. Then 



m 



-<$-a<- 



M 



max max- 



mm mm 



ru 



J z>* 



k=1 



j J2r kj 



(14) 



k = l 



Proof. Let yeE (n) , ?/>0, satisfy Ay = ay and 
let n=(By) i /y i =m&x(By) k /y k > (3. Then we assert 

that /3-a<M/y t : for (B-A)y+Ay=By, 
(B-A)y+ay=By, 



^(b ik —a ik )y k +ay i =(By) i , 

k=l 

1 n ( Bv) 

±T,(b ik -a ik )y k +a=^=»>0. 

y%k=i yi 

n 

Now yeE (n) ) hence ^>2(b ik — a ik )y k <M, and 

k=l 

j3—a<M/yi<M/mm y k . 

k 

From the above discussion we know that y=Ry/ 
<r(Ry). Consider the function 



2—1 r kj X j 



2-J Z-j r iJ X J 
j=\ i=l 



XeE { 



(15) 



minimizing the right-hand side of (15) over all 
xeE (n) yields 



2/*>min — 



1 Sr« 



Thus 



/3—a<M/min y k <M /min min — 

k k t 



r kt 



5>„ 

i = \ 



A similar computation establishes the lower bound 
in (14). 

Note that R is any non-negative matrix commuting 
with A or B) e.g., any non-negative polynomial in A 
with no zero columns. We also remark that in prac- 
tice we compute each column sum of R, divide it into 
the least element in the column, and take the least 
such quotient to obtain the denominator of the 
upper bound in (14). Similarly for the lower bound. 

5. A Matrix Condition for Transitivity 

Let Ij n denote the set of integers {1, . . ., n] and 
suppose <T], . . ., a k are functions mapping L n into 
itself. We define 31* to be the set of functions of 
the form a^a^ . . . a ik , l<ij<k. We say that 
% v is transitive for some p if for any given s and t 
in L n there exists </>e2l p such that <j>{s)=t. We define 
an 7i-square incidence matrix A associated with 91 1 
as follows: For each <r t we define Q t as the matrix 
whose (s, t) entry is 1 if o- 4 ($)=£, otherwise; then 

let A=t S*- i Qi- 1^ i s clear that A > and every row 

sum of A is 1 ; i.e., A is row stochastic. We remark 
that if T n denotes the totality of non-negative row 
stochastic 77-square matrices, then an easy argument 
[6] thows that T n is the convex hull of the set of all 
matrices with exactly one 1 in each row. 

In general let A correspond to <r u . . ., <j k and B 
to 0i, . . ., 4> m . Then the (i,j) entry of AB is posi- 
tive if and only if the integer i is mapped by some 
a a into p and p is mapped by some <j>p into j. That 
is, <pp(Ta(i)=j. Thus we have 

Theorem 9. A necessary and sufficient condition 
that §I P be transitive is that A p >0. 

We easily obtain two corollaries to this result. 

Corollary 1. Let 2li={<7i, . . ., <r k } be a set oj 
junctions on L n —{\, . . ., n} into itself satisfying 

(a) at least one a t has a fixed point, 

(b) no proper subset of L n is left invariant (as a 
set) by every a^U. Then % p is transitive forp >2n 
-2. 
Proof. Condition (b) implies that the incidence 

matrix A is irreducible. Condition (a) implies that ,. j 
tr(^4)>0 and hence h(A) = l. Hence A is primitive 
and the integer 2n— 2 follows from a result of 
Holladay and Varga [3]. 
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Corollary 2. Let 2li={ci, . . ., o k } be a set of 
permutations of the integers {1, . . .,n}=L n satisfying 

(a) there exists an integer s in L n and a pair of 
permutations <t { and aj such that a j a i (s)=s, 

(b) no proper subset of L n is left invariant (as a 
set) by every cr i€ 2Ii ? 

(c) n is odd. 

Then % p is transitive for p>4n—4. 

Proof. If there exists a <r m and an integer v such 
that <r m (v)=v then we have the conditions of corol- 
lary 1 and Sip is transitive for p>2n— 2 without the 
condition (c). If not, let A be the incidence matrix 
and then tr(^l)=0, but the sum of all the second 
order principal subdeterminants of A is not zero. 
Hence h(A) is 1 or 2. But since Ae£l n we apply 
theorem 1 to conclude that h(A) divides n. Thus 
h(A) = l and A is primitive. Now A 2 has a positive 
element on the main diagonal and h(A 2 ) = l. Thus 
A 2 is also irreducible and thus primitive so that 
applying the result in [3] again we have (v4 2 ) 2n ~€>0, 
completing the proof. 

Note that condition (c) is essential as shown by 
the following example : 2li= {<r u <r 2 } where <ri = (l 2 3 4), 
<r 2 =(l 4 3 2) satisfies conditions (a) and (b) of corol- 
lary 2. However, 2l P is not transitive for any p. 
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